Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sbarman25 's Collections
Training & Architectures
LLM Related
Med AI Papers
Datasets
Models
Safety / Alignment / Policies / SMI
Evals & Monitoring
Spaces
Agentic
Vulnerabilities
CV / Text-to-Image / Image-to-Image / Diffusion
Others
Hardware-aware Models
Text-to-nD++
Tool Usage (w/VLMs)
Vision Language Models
Audio Stuff

Others

updated Jul 25, 2024
Upvote
-

  • Masked Autoencoders Are Scalable Vision Learners

    Paper • 2111.06377 • Published Nov 11, 2021 • 3

    Note Papers (unrelated to above): 📰 Solving olympiad geometry without human demonstrations https://www.nature.com/articles/s41586-023-06747-5 https://deepmind.google/discover/blog/alphageometry-an-olympiad-level-ai-system-for-geometry/


  • Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

    Paper • 2311.00430 • Published Nov 1, 2023 • 58

  • distil-whisper/distil-large-v2

    Automatic Speech Recognition • Updated Mar 6 • 8.02k • 509

  • Seven Failure Points When Engineering a Retrieval Augmented Generation System

    Paper • 2401.05856 • Published Jan 11, 2024 • 2

  • ColPali: Efficient Document Retrieval with Vision Language Models

    Paper • 2407.01449 • Published Jun 27, 2024 • 48
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs